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" PRODUCT I ON OF GL YCOS YLPHOSPHATI DYL I NOS I TOL - ANCHORED 

RECOMBINANT PROTEINS" 



The present invention relates to a process for ex- 
pressing recombinant proteins , such as human insulin or 
5 Mycobacterium leprae ISkDa antigen, in genetically modified 
eukariotic microorganisms, especially in Saccharomyces 
cerevisiae yeast, in a form anchored to the plasma membrane by 
means of a glycosylphosphatidylinositol (GPI) anchor, and for 
selectively releasing them by methods making use of intrinsic 

10 properties provided by the presence of GPI, e.g. by treatment 
with GPI-specific phospholipase. 

The purification of proteins of medical interest for 
use in diagnosis or treatment initially involves the large- 
scale obtention of the material from which the protein will be 

15 isolated. In many cases, this has become unfeasible for se- 
veral reasons, such as costs and availability of the raw mate- 
rials, these being in many cases human organs, which implies 
potential problems of contamination of the material with virus 
that cannot be totally eliminated during the purification 

20 process. For these reasons, in producing these proteins empha- 
sis has been placed on the obtention of modified 
microorganisms that can express the products of interest. For 
instance, bacteria such as E.coli can be modified in such a 
manner that they become capable of synthesizing human proteins 

25 in large amounts. Yeasts such as S.cerevisiae and cells of 
mammals maintained in culture can also be altered in order to 
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produce the peptides of interest. The- advantages in these 
cases are the low cost of the raw material, the 
reproducibility in the obtention of large amounts of the prod- 
uct and the harralessness of the systems employed. Another 
5 great advantage of these systems is the possibility of alter- 
ing the sequence of the proteins being expressed, in order to 
adapt them to the needs of the product; these alterations, in- 
volving residues of certain amino acids, in theory can result 
in the obtention of products that are more specific, have a 

10 higher activity or exhibit additional functions not presented 
by the original product. For instance, the usually employed 
insulin is obtained from the pancreas of oxen or pigs. How- 
ever, these animal insulins differ from human insulin by 3 and 
1 amino acid residues, respectively, and this minor variation 

15 accounts for undesired immunological reactions. Therefore, the 
large-scale production of recombinant human insulin represents 
a very important step in the therapy of diabetes. Another im- 
portant advantage of the obtention of polypeptides by 
recombinant DNA technology is the possibility of modifying the 

20 product in order to facilitate its purification. For instance, 
additional residues that can be recognized and linked to a 
specific substance such as an antibody, lecitins or protein A, 
can be introduced in the recombinant protein, thus facilitat- 
ing the large-scale purification of the product by affinity. 

25 The yeast Saccharomyces cerevisiae has become an im- 

portant tool in the production of recombinant proteins and is 
often preferred to E.coli since the latter has toxic sub- 
stances associated with its wall, which implies extensive pu- 
rification processes and tests of the products synthesized in 

30 this bacterium. The yeast, on the other hand, has been used 
for centuries in the manufacture of foodstuff and is harmless 
to human beings • 

The recombinant proteins can be produced by these 
hosts and remain inside the cells. Alternatively, these prote- 

35 ins can be secreted into the culture medium. Each method has 
its advantage and, depending on the type of protein to be ex- 
pressed, one or the other is chosen. In general, for proteins 
which are usually secreted by the cells that express them na- 
turally, the secretion way is chosen for their production on a 
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large scale. Naturally intracellular proteins are usually 
produced in these organisms also in the intracellular form. 
However, although mimicking the native location, in many cases 
and for reasons which cannot be determined beforehand, one 
5 cannot succeed in producing them on a large-scale and it is 
necessary to test several forms of biosynthesis. Many prote- 
ins, when expressed intracellularly in large amounts, form 
inclusion bodies constituted by proteins aggregated in an in- 
active manner. Although the purification of these proteins is 

10 facilitated by their being in an insoluble form, their use is 
limited because this requires steps of denaturation and 
renaturation in vitro, which do not always allow the obtention 
of proteins with their natural and active conformation and, 
besides, result in low yields of the correctly folded product. 

15 The process of purifying the recombinant product 

also defines the type of expression of the protein. Thus, for 
instance, secreted proteins will be in a diluted form in the 
culture medium, which should be processed after elimination of 
the cells by centrif ugation or filtration. This implies a con- 

20 centration and purification of the product starting from hun- 
dreds or thousands of litres, requiring quite sophisticated 
and complex procedures. On the other hand, few proteins are 
secreted normally, for which reason the product of interest 
will comprise a large portion of the total protein of a 

25 supernatant of these cultures. Intracellularly produced pro- 
teins are concentrated by merely collecting the cells. In 
this case, the protein of interest will constitute a small 
amount of the total protein and, therefore, a number of pu- 
rification steps are involved. 

30 The present invention provides an alternative way of 

expressing recombinant proteins in S.cerevisiae, by which the 
recombinant product will be associated with the plasma mem- 
brane by means of a GPI anchor. The use of this method for ex- 
pressing human insulin and the M. leprae 18kD antigen is 

35 exemplified hereinbelow. However, this technology is not lim- 
ited to these examples. Other polypeptides can also be ex- 
pressed in this form. The advantages in using this technology 
will be apparent from the following description. 

All the eukaryotic organisms have proteins that are 
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normally associated with the plasma membrane by means of 
glycosylphosphatidylinositol (GPI) covalently bound to the C- 
terminal amino acid of the polypeptide chain. GPI, which is a 
structure quite well conserved throughout the evolutionary 
5 scale, contains ethanolamine, phosphate, mannose, occasionally 
galactose and necessarily a non-acetylated glucosamine, which 
is in glycosidic linkage with the inositol ring of a 
phosphatidylinositol molecule (Ferguson & Williams, 1989, Ann. 
Rev. Biochem. 59:285; Thomas et al., 1990, Biochemistry 29 

10 :5413). This structure is co-translationally added at the 
level of the endoplasmic reticulum in a process of which the 
rapidity suggests that the nascent protein receives the al- 
ready preformed GPI anchor by means of enzymatic steps not yet 
totally characterized (Doering et al., 1989, J. Biol. Chem. 

15 265:611). The precursors of the GPI-anchored proteins contain 
a hydrophobic sequence of from 15 to 30 amino acids at the 
C-terminal, which is eliminated during the processing, gener- 
ating a new C-terminus, to which the GPI structure is added. 
Thus, both transmembranic and secreted proteins can be con- 

20 verted into their respective GPI-anchored variants by adding 
this signal sequence to their respective biosynthetic precur- 
sors (Caras et al, 1987, Science 238:1280; Crise et al., 1989, 
J. Virol. 63 :5328; Tykocinsky et al., 1988, Proc. Natl. 
Acad. Sci. USA 85 :3555). Although there is no absolute con- 

25 sensus regarding the primary structure of the terminal 
peptide, the analysis of the sequences that flank the site of 
addition of GPI indicates that three aspects seem important in 
the process, in both mammals and protozoa: a hydrophobic se- ' 
guence of a minimum size, the absence of possible 

30 cytoplasmatic domains and a somewhat indefinite pattern of 
amino acid sequences or conformation around the site of 
cleavage (Low, 1990, Biochim, Biophys. Acta 988 427; Ferguson 
& Williams, 1989, op. cit.; Caras & Weddell, 1989, Science 243 
:1196) . 

35 The GPI-anchored proteins are usually released from 

the membrane when treated with inositol-specif ic 
phospholipase, which treatment, upon hydrolysing 

phosphatidylinositol, breaks the domain of interaction with 
the membrane (Low, 1990, op.cit.). Several types of 
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phospholipase C (PLC) and D capable of . solubilizing 
GPI-containing molecules in vitro have been described* Some of 
these types, such as the phospholipases C of bacterial origin, 
have a broad specificity and hydrolyse phosphatidylinositol 
5 and GPI, while others, from mammals or trypanosomes , seem to 
be highly specific for GPI (Low, 1990, op. cit.)* their 
native condition GPI-containing glycoproteins exhibit an 
amphiphilic character and, after solubilization by PLC, ac- 
quire a hydrophilic character (Ferguson & Williams, op. cit.). 

10 Based on this difference of behaviour, easily induced by 
enzymatic treatment, several purification techniques based on 
phase separation in Triton X-114 (Bordier, 1981, J. Biol. 
Chem. 256 :1604; Cardoso de Almeida & Turner, 1983, Nature 
302 :349, and charge displacement electrophoresis, Toutant et 

15 al., 1989, Eur. J. Biochem. 180:503) have been employed for 
obtaining and characterizing glycoproteins with this post- 
translational modification. 

Besides the in vitro enzymatic treatment, the re- 
lease of GPI-anchored proteins can also be mediated in vivo by 

20 means of the co-expression of phospholipases (Scallon et al., 
1992, Bio/Technology 10:500). The genes coding for some 
phospholipases, including GPI-specific phospholipase C, have 
already been cloned and sequenced; the expression of these ac- 
tivities in cells of interest is, therefore, possible. 

25 Concomitantly with the solubilization by PLC the re- 

leased protein exposes inositol 1,2-cyclic phosphate in the 
structure of its hydrolysed GPI, which is a structural marker 
known as "cross-reacting determinant" (CRD), and an epitope 
classically recognized as an antibody (anti-CRD). This 

30 polyclonal antibody has the property of discriminating the en- 
tire structure of GPI from the structure hydrolysed by 
phospholipases C and further allows the selective purification 
of GPI-containing molecules by immunoaf f inity . (Cardoso de 
Almeida & Turner, 1983, op. cit.; Zamze et al., 1988, Eur. J. 

35 Biochem. 176 :527). 

Consequently, the GPI moiety of an anchored protein 
can also be obtained by chemical or enzymatic treatments, and 
free GPI's can be purified by selective extraction with or- 
ganic solvents (Orlean et al, 1994, Brazilian J. Med. Biol. 
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Res. Vol. 27, page 145). 

In S. cerevisiae only one protein is predominantly 
solubilized with phospholipase C and visualized with anti-CRD 
antibody (Conzelmann et al., 1988, EMBO J. 7 :2233). The 
5 function of this 125 kDa protein, called Gasl, is still un- 
known. The molecular characterization of the gene which codes 
for it (GASl) indicates that this protein is not necessary for 
the normal growth of the yeast. 

In view of the foregoing, any protein can receive a 

10 GPI-anchor and consequently remain linked to the plasma mem- 
brane, if the appropriate signal sequence is added to its 
carboxyl terminus. As it will be exemplified hereinafter, in 
S.cerevisiae lacking the Gasl endogenous protein, the 
recombinant product containing the GPI-anchor will practically 

15 be the only protein to be recognized and cleaved by 
phospholipase. Therefore, this method represents a unique ad- 
vantage of releasing the product in a highly specific and se- 
lective manner. In addition, the recombinant product will 
contain an epitope (CRD) which can be used in its final pu- 

20 rification by immunoaf f inity . 

Human insulin, commercially available for the treat- 
ment of insulin-dependent diabetes is derived, at present, 
from three alternative sources, namely: a) chemically modified 
animal insulin (Markussen, J., 1980, US Patent 3,433,898); b) 

25 recombinant human insulin, produced in Saccharomyces 
cerevisiae in secreted form (Thim et al., 1987, FEBS Letters 
212 :307), and c) recombinant human insulin produced in E. 
coll in the form of intracellular pro-insulin, which after pu- 
rification is enzymatically cleaved with tripsin and 

30 carboxypeptidase B, providing human insulin (Williams et al., 
1982, Science 215:687; Sung et al., 1986, Proc. Natl. Acad. 
Sci., 83:561). The present invention is exemplified in this 
alternative form of expressing human insulin in the yeast 
S. cerevisiae, in which the insulin molecule will be originally 

35 anchored via GPI to the cellular membrane, which allows the 
facilitated purification of the product by means of the treat- 
ment of the cells with GPI-specific phospholipase C. 

The Mycobacterium leprae 18kDa protein is one of the 
main targets of the cellular immune response in individuals 
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infected by this bacillus (Booth et al., 1988, J. Immunol. 
140:597). M. leprae is at present a problem of underdeveloped 
and developing countries, causing leprosy in approximately 15 
million people (WHO Bulletin, 1988, World Health 
5 Organization/Switzerland) . Due to the difficulty in cultivat- 
ing it, little is known about the biology, biochemistry or 
immunology of this disease. It was suggested that the ISkDa 
protein might represent a potential vaccine against this dis- 
ease (Young et al., 1988, Proc. Natl. Acad. Sci. USA 85:4267). 

10 With a view to making a indepth study of the cellular and hu- 
moral immune response to this antigen, many attempts have been 
made to obtain its expression and purification on a large 
scale in £• coll, but without success due to the formation of 
insoluble aggregates, difficulty in the purification and 

15 proteolytic cleavage of the recombinant protein. The problem 
was appearently solved by its expression in S. cerevisiae, in 
both intracellular and extracellular form (Booth et al., 1988, 
Immunol. Lett. 19 :65; Piestun, 199^2, Master Dissertation, 
Dept. Immunol., ICB, YSO) . The present invention represents an 

20 alternative for obtaining a H. leprae 18kDa protein in 
membrane-anchored form in S. cerevisiae. This membrane- 
anchored form can offer advantages over the soluble forms of 
this protein, or of other antigens, for vaccination using, for 
instance, liposomes covered with this GPI-anchored protein. In 

25 addition, this anchored protein can be used in agglutination 
assays, which are admittedly the most economical serological 
test for infectious diseases, especially in less developed 
countries. Besides, a GPI-containing immunogen can elicit an 
immune response modulated by parameters that, although not yet 

30 well-known, can favour the protection (Schofield & Harckett, 
1993, J. Exp. Medicine 177 :145). 

Thus, the present invention represents an alterna- 
tive in the way of expressing recombinant proteins in 
Saccharomyces cerevisiae, with many practical possibilities, 

35 comprising: 

a) the protein will contain a carboxy-terminal 
moiety capable of locating it in the cytoplasma membrane? 

b) said localization will be due to the presence of 
a structure called glycosylphosphatidylinositol (GPI) anchor. 
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the addition of which to the protein is mediated by the 
carboxy-terminal sequence referred to under a); 

c) the carboxy-terminal sequence will be derived 
from the S.cerevisiae Gasl protein; 
5 d) the expressed protein can be any protein compo- 

nent usually found in eurokaryotic or prokaryotic organisms, 
such as hormones, surface antigens, secreted peptides, 
cytoplasmatic proteins etc; 

e) the expressed protein can be human insulin, or 
10 the Mycobacterium leprae 18kDa antigen; 

f) the S.cerevisiae strain host can be one lacking 
the Gasl endogenous protein; 

g) the GPI-anchored protein can be detected by the 
presence of the epitope CRD after treatment with GPI-specific 

15 phospholipase C, or by the alteration of its amphiphilicity 
after treatment with GPI-specific phospholipase; 

h) the GPI-anchored protein can be released specif- 
ically from the insoluble fraction of membranes, from a ex- 
tract of these yeasts, prepared by breaking with glass beads, 

20 by the action of GPI-specific phospholipase. 

The present invention also comprises a method of ob- 
taining yeast strains expressing human insulin or the 
Mycobacterium leprae IBkDa antigen in membrane-anchored form, 
which method comprises: 

25 al) chemical synthesis of the insulin gene, coding 

only for chains B and A, these chains being linked by dibasic 
residues; or 

a2) enzymatic synthesis of the sequence coding for 
the Mycobacterium leprae ISkDa antigen; 
30 b) construction of vectors suitable for expressing 

GPI-anchored proteins in S.cerevisiae; 

c) insertion of the sequences coding for insulin or 
for the IBkDa antigen in these vectors; and 

d) introduction of the plasmids obtained in c) in 
35 yeast strains lacking the endogenous protein Gasl. 

This invention also comprises detecting the pro- 
duction of insulin or of the IBkDa antigen in the cells trans- 
formed with the plasmids, by means of immuno-blots from 
SDS-PAGE or directly from colonies. 
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This invention further comprises detecting the pres- 
ence of GPI anchor in the proteins insulin and 18kD, by means 
of immuno-blots with antibody anti-CRD after treatment of the 
filters with PLC. 
5 Gasl being pratically the only GPI-anchored protein 

present in the yeast cell which is sensitive to hydrolysis by 
GPI-specific phospholipase C, the present invention provides a 
highly selective method of purifying proteins. 

Thus, an object of the present invention is to pro- 
10 vide a process for producing a recombinant protein, or a pre- 
cursor thereof, in cells of genetically modified eukariotic 
microorganisms, characterized by comprising the steps of: 

a) biosynthesizing said protein or a precursor 
thereof by the cell of said microorganism and linking the 

15 endogenous glycosylphosphatidylinositol (GPI) to the C- 
terminal amino acid of the obtained protein or its precursor, 
with the consequent anchoring of said protein or precursor to 
membranes of said microorganism by means of GPI; and 

b) selectively releasing the protein or its precur- 
20 sor obtained in step a) by methods making use of intrinsic 

properties provided by the presence of GPI. 

A specific embodiment of this process is character- 
ized in that said eukaryotic microorganism is a genetically 
modified Saccharomyces cerevisiae. 
25 Other eukaryotic microorganisms that are also con- 

templated herein are Dictyostelium discoideum, trypanosomes 
and other yeasts such as Pichia pastoris and Hansenula 
polymorpha. 

Another specific embodiment of this process is char- 
30 acterized in that said eukaryotic microorganism is 
Saccharomyces cerevisiae genetically modified so as not to 
produce endogenous GPI-anchored proteins. 

Another specific embodiment of this process is char- 
acterized in that said eukaryotic microorganism is 
35 Saccharomyces cerevisiae genetically modified so as not to 
produce the Gasl endogenous protein. 

Another specific embodiment of this process is char- 
acterized in that said recombinant protein is human insulin. 

Another specific embodiment of this process is char- 
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acterized in that said recombinant protein is Mycobacterium 
leprae 18kDa antigen. 

Another specific embodiment of this process is char- 
acterized in that the selective release of the protein or its 
5 precursor obtained in step a) is carried out by enzymatic 
treatment . 

Another specific embodiment of this process is char- 
acterized in that the enzymatic treatment is carried out with 
PI- or GPl-specific phospholipase. 

10 Another specific embodiment of this process is char- 

acterized in that the enzymatic treatment is carried out with 
PI- or GPI-specific phospholipase of C or D specificity. 

In a more specific embodiment of this process r one 
works with PI- or GPI-specific phospholipase C, obtaining 

15 hydrolysed protein or its precursor, which, in its structure, 
presents inositol 1,2-cyclic phosphate (epitope CRD), which is 
susceptible of being selectively purified by immunoaf f inity 
with anti-CRD antibodies. 

Another specific embodiment of this process is char- 

20 acterized in that the enzymatic treatment is carried out with 
proteases • 

Another specific embodiment of this process is char- 
acterized in that the selective release of the protein or its 
precursor obtained in step a) is carried out by chemical 
25 treatment. 

Another specific embodiment of this process is char- 
acterized in that the chemical treatment is carried out by 
nitrous deamination. 

Another specific embodiment is characterized in that 
30 the chemical treatment is carried out with a base. 

Another specific embodiment of this process is char- 
acterized in that the selective release of the protein or its 
precursor obtained in step a) is carried out by treatment with 
neutral detergents. 
35 In another specific embodiment of this process, this 

treatment is carried out with neutral detergents of the n- 
octylglucopy ranoside type . 

Another specific embodiment of this process is char- 
acterized in that the selective release of the protein or its 
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precursor obtained in step a) is carried out by co-expression 
of a PI- or GPI-specific phospholipase in the cell of said 
microorganism. 

It is also an object of this invention to provide a 
5 process for the production of glycosylphosphatidylinositol 
(GPI) and a recombinant protein in cells of genetically modi- 
fied eukaryotic microorganisms, characterized by comprising 
the steps of: 

a) biosynthesizing said protein or a precursor 
10 thereof and biosynthesizing GPI or a precursor thereof by the 

cell of said microorganism, and linking the endogenous 
glycosylphosphatildylinosi tol (GPI) to the C-terminal amino 
acid of the obtained protein or its precursor, with the conse- 
quent anchoring of said protein or its precursor to membranes 
15 of "said microorganism by means of GPI; 

b) selectively releasing the obtained protein or its 
precursor by methods making use of intrinsic properties pro- 
vided by the presence of GPI; and 

c) selectively releasing GPI. 

20 In a specific embodiment of this process, said 

eukaryotic microorganism is a genetically modified 
Saccharomyces cerevisiae. 

Other eukaryotic microorganisms that are also con- 
templated here are Dictyostelium discoideum, trypanosomes and 

25 other yeasts, such as Pichia pastoris and Hansenula 
polymorpha • 

In a specific embodiment of this process, said 
eukariotic microorganism is a Saccharomyces cerevisiae genet- 
ically modified so as not to produce GPI-anchored endogenous 
30 proteins. 

In a specific embodiment of this process, said 
eukaryotic microorganism is a Saccharomyces cerevisiae genet- 
ically modified so as not to produce the endogenous Gasl pro- 
tein. 

35 In a specific embodiment of this process, said 

recombinant protein is human insulin. 

In another specific embodiment of this process, said 
recombinant protein is Mycobacterium leprae ISkDa antigen. 

As a specific embodiment of such process the selec- 
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tive release of a protein or its precursor or GPI or its pre- 
cursor obtained in step a) is obtained by enzymatic treatment. 
This treatment can make use of phospholipases with PI- or 
GPI-specif icity , and specially with PI- or GPI-specific 
5 phospholipases of C or D specificity. After rupture of yeast 
the resulting particulate fraction containing the membranes is 
resuspended in a minimum volume of 0.1% Triton X-114 in 50 mM 
Tris.HCl pH 7.4 with 150 mM NaCl. Then, 1600 U/ml of the 
phospholipase C either from Trypanosoma brucei (GPI-specific) 

10 or from Bacillus thuringiensis (Pl-specif ic) is added and the 
mixture, after gentle mixing, is incubated for 1 hour at 30*C. 
Alternatively, the yeast membranes containing the GPI-anchored 
recombinant protein can be resuspended in 50 mM Tris. acetate 
pH 5.4 and 4000 U/ml of GPI-specific phospholipase D from hu- 

15 man or rat is added and the reaction mixture incubated for 1 
hour at 30**C. At the end of these incubations either the 
supernatant after centr if ugation or the whole incubation mix- 
ture will contain the target protein or the precursor in its 
hydrolysed form which presents in its structure the inositol 

20 1,2-cyclic phosphate (anti-CRD epitope), which can be selec- 
tively purified by immunoaf f inity with anti-CRD antibodies. 
Antibody with this specificity can be obtained from rabbits 
immunized with ILTat 1.21 mf VSG from T. brucei after its con- 
version to its soluble form ILTat 1.21 sVSG by the action of 

25 1600 U/ml of T. brucei phospholipase C in 10 mM Tris.HCl pH 
7.4 containing 0.05% Triton X-114 and 0.05% n- 
octylglucopyranoside for 3 hours at 30**C. New Zealand rabbits 
are immunized with 100 micrograms of ILTaT 1.21 sVSG produced 
as described above, in form of an emulsion with 0.5 ml of 

30 137mM NaCl, 2.7 mM KCl, 4.5mM Na2HP04, 1.5 mM KH2PO4PH 7.4 
(PBS) and 0.5 ml of Freund's complete adjuvant commercially 
available. Subsequent boosts at monthly intervals are made 
with 25 micrograms of ILTaT 1.21 sVSG in 0.5 ml of PBS in 
Freund's incomplete adjuvant. Rabbits are bled exactly 2 weeks 

35 after each boost. The serum from several rabbits is pooled and 
applied to a column in which MITat 1.5 VSG has been 
immobilized on Sepharose CL 4-B (Pharmacia) using standard 
protocols. The antibody population bound to the column is 
eluted with O.IM glycine pH 2.4 and each ml of fraction is 
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collected on ice and immediately neutralized with 90 
microlitres of IM Tris.HCl pH 7.4. The elation of the antibody 
is followed by following optical density at 280 nm, and all 
the fractions with ODs above 0.1 are pooled and applied to a 
5 column where MITat 1.6 mf VSG has been immobilized on 
Sepharose-CL-4B (Pharmacia) using standard protocols. The 
fraction of antibody which does not bind to the column is the 
so called anti-CRD antibody which will react with GPI-anchored 
proteins hydrolysed by phospholipase C but not by 
10 phospholipase D. This antibody is then immobilized on 
Sepharose CL-4B and this resin can be used several times to 
purify, by affinity, phospholipase C-solubilized GPI compo- 
nents. 

The enzymatic treatment for release of GPI-anchored proteins 
15 can also be performed by specific proteases provided that an 
unique site for its action is carefully planned at the design 
of the recombinant protein. In the case of the constructs men- 
tioned in this patent application a trypsin site was inserted 
exactly at the fusion sites. Accessibility of the protease to 
20 the site might depend on folding of each recombinant protein. 
In this case the released products will not contain the CRD 
epitope since the GPI is left inserted in the plasma membrane. 

Still in a further embodiment of this process, the selective 
release of the protein or its precursor or GPI or its precur- 

25 sor obtained in step C is obtained by chemical treatment with 
100 mM NaOH for 1 hour at 37®C in the case of di-acylglycerol- 
based anchors or alternatively by addition of 48% hydrofluoric 
acid. Solvent extraction can be performed by treating each ml 
of membranes with 6.9 ml of a mixture of chloroform and 

30 methanol (1:1), this being an extraction in chloroform : 
methanol : water (10:10:3 v/v/v). After extraction of GPI or 
precursors for 2 hours at room temperature, the insoluble ma- 
terial is centrifuged at 4,000 g for 10 minutes and the 
supernatant dried under a N2 stream. This material is then re- 

35 suspended in 100 microlitres of water-saturated 1-butanol and 
the salts removed by addition of 50 microlitres of water. The 
aqueous phase is again re-extracted with 100 microlitres of 
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water-saturated 1-butanol. The same procedure is still re- 
peated for a third time and the pooled organic phases washed 
twice with 50 microlitres of water-saturated 1-butanol and 
then dried under N2. 
5 A further specific embodiment of this process of re- 

leasing protein or its precursor obtained in the step c) re- 
quires treatment with neutral detergents such as 
n-octylglucopyranoside. In this type of treatment the mem- 
branes containing the product are extracted with 0.5% of the 
10 detergent in PBS and centrifuged at 100,000 g for 1 hour at 

Still another specific embodiment of this process of releasing 
the protein or its precursor obtained in step c) consists of 
co-expressing a PI- or GPI- specific phospholipase in the same 

15 microorganism. 

It is also an object of this invention to provide a 
process for the production of glycosylphosphatidylinositol 
(GPI) in cells of genetically modified eukaryotic 
microorganisms, characterized by comprising the steps of: 

20 a) biosynthesizing a protein or a precursor thereof 

and biosynthesizing GPI or a precursor thereof by the cell of 
said microorganism and possibly linking the endogenous 
glycosylphosphotidylinositol (GPI) to the C-terminal amino 
acid of said protein or its precursor obtained in step a), 

25 with the consequent anchoring of said protein or its precursor 
to membranes of said microorganism by means of GPI; and 

b) selectively releasing GPI. 

In a specific embodiment of this process, said 
eukaryotic microorganism is genetically modified Saccharomyces 
30 cerevisiae. Other eukaryotic microorganisms that are also 
contemplated here are Dictyostelium discoideum, trypanosomes 
and other yeasts, such as Pichia pastoris and Hansenula 
polymorpha. 

In a specific embodiment of this process, said 
35 eukaryotic microorganism is Saccharomyces cerevisiae genet- 
ically modified so as not to produce GPI-anchored endogenous 
proteins. 

In a more specific embodiment of this process, said 
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eukaryotic microorganism is a Saccharomyces cerevisiae genet- 
ically modified so as not to produce the Gasl endogenous pro- 
tein. 

It is also an object of this invention to provide a 
5 process for the obtention of Sacchromyces cerevisiae yeast 
cells capable of expressing a recombinant protein, character- 
ized by comprising the steps of: 

a) providing the gene which codes for said 
recombinant protein or its precursor so as to be anchored via 

10 GPI; 

b) inserting the gene obtained in step a) in an ade- 
quate vector; 

c) introducing the vector containing the gene which 
codes for said recombinant protein or its precursor in a 

15 Saccharomyces cerevisiae cell; and 

d) propagating the cell obtained in step b). 

In a more specific embodiment of this process , said 
recombinant protein is human insulin. 

In another more specific embodiment of this process, 
20 said recombinant protein is Mycobacterium leprae 18kDa. 

In another more specific embodiment of this process, 
the Saccharomyces cerevisiae cell used lacks the capacity of 
producing the Gasl endogenous protein. 

It is also an object of this invention a yeast cell 
25 characterized by being genetically modified so as to express a 
GPI-anchored recombinant protein. 

In a more specific embodiment, this yeast cell is 
Saccharomyces cerevisiae. In still a more specific embod- 
iment, this yeast cell is characterized by not expressing the 
30 Gasl endogenous protein. 

In a further more specific embodiment, this yeast 
cell is characterized by the capacity of raising the levels of 
GPI biosynthesis or of producing GPI with determined proper- 
ties. 

35 In a further more specific embodiment, this yeast 

cell can be characterized by the fact that said recombinant 
protein is human insulin or Mycobacterium leprae 18kDa 
antigen. 

It is also an object of this invention a nucleotide 
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sequence characterized by coding for a recombinant protein or 
its precursor susceptible of being anchored by GPI. 

Also an object of this invention is to provide a 
nucleotide sequence characterized by coding for: 
5 (i) the C-terminal moiety of the Gasl protein of 

Saccharomyces cerevisae; 

(ii) a protein or its precursor; and 

(iii) the N-terminal moiety of the Gasl protein of 
Saccharomyces cerevisiae. 

10 In specific embodiments, this nucleotide sequence 

can be characterized in that said protein is human insulin or 
in that such a protein is Mycobacterium leprae 18kDa antigen. 

It is also an object of this invention a culture me- 
dium characterized by containing cells in accordance with this 

15 invention described in the preceding paragraphs. 

It is a further object of this invention to provide 
a medicament or vaccine characterized by containing a 
recombinant protein obtained by the processes of this in- 
vention described above, or by containing a recombinant pro- 

20 tein whose precursor has been obtained by one of the processes 
of this invention described above. Compositions containing 
recombinant human insulin or Mycobacterium leprae 18kDa 
antigen are specifically contemplated here. 

In order to help in the interpretation of the mean- 

25 ing of the present invention, some terms used in the text are 
described below. 

Coding sequence - A sequence of DNA which, when 
transcribed and translated, results in the formation of a 
polypeptide. 

30 Gene - A region of the genome comprising the coding 

sequence and sequences responsible for the control of its ex- 
pression, that is to say, transcription and translation. 

Secretion signal sequence - A sequence of 
hydrophobic amino acids present in the amino-terminal moiety 

35 of a polypeptide, which has the function of directing this 
polypeptide to the endoplasmic reticulum and, consequently, in 
the absence of any other signal, to the outer environment. 

A Sequence Signalizing the Addition of a GPI Anchor 
- A sequence of hydrophobic amino acids present in the 
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carboxy-terminal moiety of a polypeptide signalizing the 
proteolytic cleavage in a given residue r to which the GPI an- 
chor will be added. 

GPI Anchor - A glycolipid linked to the C-terminal 
5 amino acid of proteins intended for being anchored to the cel- 
lular lipidic membrane. 

In order to construct fusions of a given polypeptide 
and a GPI-anchor domain, DNA coding for the C-terminal 30-50 
residues of a protein that usually contains this anchor is 

10 ligated to DNA coding for the polypeptide in question. This 
fusion is made in the C-terminal moiety of said polypeptide. 
The C-terminal 10 - 20 hydrophobic residues will be processed 
after translation and eliminated from the mature protein. The 
construction of this fusion is accomplished by those skilled 

15 in the art by means of routine techniques. For instance, DNA 
coding for the region signalizing the addition of an anchor 
can be synthesized in vitro or isolated from a genomic DNA or 

CDNA . 

The host for the expression of these anchored prote- 
20 ins is S. cerevisiae especially modified, non-coding for Gasl. 
Gasl is the main or the only GPI-anchored protein which is 
susceptible to the action of GPI-specific phospholipase C. 
Thus, when strains are used from which the gene GASl has been 
deleted, the treatment of cellular extracts of this strain ex- 
25 pressing the specific GPI-anchored protein will release only 
the recombinant product essentially free from other contam- 
inants. 

In order to obtain these products in the mentioned 
hosts, the DNA coding for the product is introduced in these 

30 hosts by several techniques, for instance, transformation, 
electroporation, transfection etc. However, the mere introduc- 
tion of the DNA coding for the product in these cells is not 
enough for them to start synthesizing it. It is necessary for 
this DNA to be capable of being transcribed into a messenger 

35 RNA molecule; for this purpose the DNA has to be coupled to a 
promoter sequence, which is a deoxynucleotide sequence specif- 
ically recognized by the transcription machinery of the cell 
in question. Once this sequence is recognized, the RNA 
polymerase starts to synthesize the messenger RNA using the 
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DNA molecule in question as a template. The promoter sequences 
are specific for each organism. Thus, if the protein in ques- 
tion is produced in S. cerevisiae the coder DNA should be pre- 
ceded by a promoter sequence which is recognized by the S. 
5 cerevisiae RNA polymerase. Once the messenger RNA is synthe- 
sized, it can be translated into a protein molecule. It is 
also important for the translation signals to be present in 
the sequence introduced in the cell. 

The sequence coding for the protein that is to be 

10 obtained can be obtained by several cloning methods known to 
those versed in the art. A DNA segment can be obtained di- 
rectly from the chromosomal DNA, can be synthesized from the 
messenger RNA or can be chemically synthesized. 

This DNA should then be inserted downstream of a 

15 promoter sequence. Several promoters can be used for ex- 
pression in S. cerevisiae and are in the public domain. For 
instance, the promoter pGALl, derived from the gene GALl or 
the promoter ADH2, derived from the gene coding for alcoholic 
dehydrogenase gene can be used. 

20 The type of promoter is chosen depending on the sys- 

tem to be employed or on the process or on the product. There 
are constitutive promoters and, therefore, in these cases, the 
cell will produce the protein in question continuously, which 
can be favourable in cases of co-expression of GPI-specific 

25 phospholipase . In other cases, it is preferred that the cell 
produces the protein in question only when it is programmed 
for this purpose, and in such cases the so-called inducible 
promoters are used, that is to say, promoters that are specif- 
ically activated when their functioning is required, usually 

30 at the end of the fermentation process. 

In order for the organism to produce the protein in 
question, it is not sufficient for its coding sequence to be 
coupled to a promoter and this assembly to be introduced in 
the organism. This assembly has to be maintained stable within 

35 that organism, avoiding its spontaneous loss. For this pur- 
pose, this assembly is incorporated in plasmids, which are 
generally circular DNA molecules, maintained independently in- 
side the cells. Several types of plasmids are known and in the 
public domain. General characteristics of plasmids used as 
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vectors are: a) being capable of autonomously replicating 
within the host; b) having a selectable marker, that is to 
sayr a gene that is essential for the survival of the host. 
Typical selectable genes are: a) for selection in bacteria, 
5 they should preferably impart resistance to antibiotics, for 
instance ampicillin or tetracycline; b) for selection in S. 
cerevisiae, they should complement auxotrophic def f iciencies 
of the host, that is to say, they should contain genes which 
allow the synthesis of essential components absent from the 
10 culture medium. 

In order to simplify the description of the exam- 
ples, certain methods will be mentioned by their routine no- 
menclature. 

"Plasmids" are designated by a "p" followed by capi- 
15 tal letters and/or numbers. The starting plasmids described 
herein are commercially available, are publicly available 
without restrictions, or can be constructed from such avail- 
able plasmids in accordance with published procedures. In ad- 
dition, other equivalent plasmids are known to those skilled 
20 in the art. 

r 

"Digestion" of DNA refers to the catalytic cleavage 
of the DNA with an enzyme that only acts at certain locations 
of the DNA. These enzymes are called restriction enzymes, and 
the specificity sites are called restriction sites. The 

25 enzymes used herein are commercially available, and their re- 
action conditions, co-factors and other requirements are those 
supplied by the manufacturers. Restriction enzymes are desig- 
nated by three letters followed by a number. In general, about 
1 microgram of plasmidial DNA or DNA fragment is used with 

30 about 2 units of the enzyme, in about 20 microliters of buffer 
solution, as specified by the manufacturer. Incubation pro- 
ceeds for about 1 hour at 37**C, or as specified by the man- 
ufacturer. After incubation, protein is extracted with phenol 
and chloroform, and the digested DNA is recovered from the 

35 aqueous phase by precipitation with ethanol. Infrequently the 
plasmidial DNA can be dephosphorylated by treatment with the 
bacterial alkaline phosphatase enzyme, which removes the ter- 
minal 5* phosphates, preventing the circular ization of the 
plasmid during the reaction with the enzyme ligase T4, which 
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would prevent the insertion of another DNA fragment in that 
restriction site. , 

"Filling" or "blunting" refers to the procedure by 
which the cohesive termini left by a restriction enzyme are 
5 filled with deoxynucleotides by the action of the DNA 
polymerase I enzyme (Klenow fragment), becoming blunt. The 
filling of the termini allows the linking of any fragment thus 
treated with another blunt-ended DNA. Typically, the reaction 
consists of the incubation of the target DNA with a buffer ac- 

10 cording to the specification of the manufacturer, in the pres- 
ence of 8 units of DNA polymerase I (Klenow) and 250 microM of 
each of the four deoxinucleotides . The incubation proceeds for 
30 minutes at 37**C, and is followed by extraction with phenol 
and chloroform, and precipitation with ethanol. 

15 "Isolation of DNA fragment" refers to the separation 

of the products of a digestion with a restriction enzyme by 
electrophoresis in agarose gel, identification of the fragment 
of interest by comparing its migration with standards of known 
molecular weights, removal of the gel portion containing the 

20 fragment of interest and separation of the DNA from the gel, 
according to well-known techniques. For instance, see Sambrook 
et al. , 1989) . 

"Ligation" refers to the process of forming a 
phosphodiester bonds between two double-stranded DNA fragments 

25 (Sambrook). The ligation reaction consists of the incubation 
of 0.5 microgram of the two DNA fragments in approximately 
equimolar amounts, in the presence of 10 units of the DNA T4 
ligase enzyme, in the presence of a buffer as specified by the 
manufacturers • 

30 "Transformation" refers to the method of introducing 

DNA in an organism, so that this DNA replicates 
extrachromosomally or integrated in the chromosome. The method 
of transforming E. coll is the one described by Mandel et al., 
1970, J.Mol.Biol. 53,154. The method of transforming S, 

35 cerevisiae is the one described by Ito et al., 1983, J. 
Bacterid. 153; 163. 

"Preparation of DNA of the transf ormants" refers to 
the method of isolating plasmidial DNA from microbial cul- 
tures. The method employed for bacteria can be that of 
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alkaline lysis in the presence of SDS, as described in 
Sambrook et al., 1989, Molecular Cloningr Laboratory Manual. 

"Oligonucleotides" are short polymers of 
deoxynucleotides, which are chemically synthesized by well- 
5 known methods. 

"Western blots" or "immuno-blots" refer to the 
method of identifying proteins separated in SDS-acrylamide gel 
by the use of specific antibodies. The cellular extracts or 
fractions thereof are applied to polyacrylamide denaturating 

10 gel (Laemmli, 1970, Nature 227:680) and subjected to 
electrophoresis. The separated proteins are transferred to 
nitrocellulose filters and subsequently subjected to reaction 
with antiserum or specific monoclonal antibodies. 

"PGR Reaction" refers to the enzymatic amplification 

15 of a double-stranded DNA chain starting from two 
oligonucleotides complementary to the two 5' ends of the tem- 
plate molecule. The DNA to be amplified can be genomic DNA or 
DNA present in a plasmid. The reaction uses the enzyme Tag 
polymerase and is carried out as specified by the manufac- 

20 turer, in a thermal cycle apparatus, for approximately 40 cy- 
cles. 

"Sequencing" of DNA refers to the determination of 
nucleotides present in a DNA chain and can be carried out by 
the enzymatic method with the enzyme Sequenase, as specified 

25 by the manufacturer, and the DNA chains generated are sepa- 
rated in polyacrylamide-urea denaturating gel and visualized 
by autoradiography, as described in Sambrook et al., op.cit.. 

Nucleotides are designated by the standard letters, 
A being adenine, T thymine, C cytosine and G guanine. 

30 Aminoacids are represented by the designations of a 

letter, wherein A is alanine, R is arginine, N is asparagine, 
D is aspartate, C is cysteine, Q is glutamine, E is glutamate, 
G is glycine, H is histidine, I is isoleucine, L is leucine, K 
is lysine, M is methionine, F is phenylalanine, P is proline, 

35 S is serine, T is treonine, W is tryptophan, Y is tyrosine, V 
is valine. 

"GPI extraction" means the recovery of this 
glycoinositolphospholipid by using a series of organic 
solvent/aqueous extractions which selectively separate it from 
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proteins, DNA, lipids and phospholipids. The products of this 
extraction can be analysed using standard processes of thin 
layer chromatography or reversed phase thin layer 
chromatography. Occasionally it is also appropriate to 
5 fractionate the products on an Octyl-Sepharose column 
(Pharmacia), using appropriate gradients of 5-80% of 
1-propanol . 

The scheme and the figures will now be described in 
detail, to which reference will be made in the examples of use 

10 of this invention. 

Figure 1 - Scheme of the Precursor and of the 
GPI-anchored Final Product in S. cerevisiae, and of its Re- 
lease by Phospholipase. Any protein "X" can be coupled to the 
N-terminal region of Gasl and to the C-terminal region of 

15 Gasl. The post-translational processing during transport to 
the endoplasmic reticulum will allow the elimination of the 
N-terrainal sequences of Gasl, through signal peptidase and the 
endopeptidase Kex2, which recognizes and cleaves in dibasic 
residues, and the coupling of the GPI anchor. This 

20 polypeptide will remain anchored to the membrane. Through the 
action of phospholipase C, the anchor will be hydrolysed, and 
the polypeptide can be released from the membrane, with the 
simultaneous creation of the CRD epitope. 

Figures 2A and 2B. Strategy of Construction of Vec- 

2 5 tors for Expression of GPI-anchored Insulin (2A) and 
GPI-anchored 18kDa Protein (2B) in S. cerevisiae. The 
plasmids of the series pBY constructed here are derived from 
yEp352, and are bif unctional , that is to say, they replicate 
and can be selected in both E.coli and S.cerevisiae. The 

30 other plasmids are replicable only in E. coll and have served 
only for the initial obtention, through PGR or from 
oligonucleotides, of the several fragments of DNA that consti- 
tute the examples. The scheme is self-explanatory, the arrows 
refer to the steps of cleavage by restriction enzyme, iso- 

35 lation of DNA fragment and its insertion in another plasmid. 

Figure 3. Sequence of the DNA fragment encoding a 
GPI-anchored insulin (Seq. ID No. 1). The complete nucleotide 
sequence of the construct present on plasmid pBY40, derived 
from GASI and insulin is shown, along with the amino acid se- 
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quence of the expected product. In the 5' end, the GASl- de- 
rived sequences start at nucleotide 15 and end at nucleotide 
^ 380, and in the 3* end, start at nucleotide 560 and end at 
nucleotide 1058. The insulin-coding sequence starts at 
5 nucleotide 393, with a phenylalanine codon, of the B chain, 
and ends at nucleotide 551, with an asparagine codon of the A 
chain. Between the B and A chains of insulin, two basic resi- 
dues were included. The anchor attachment site is at the 
asparagine residue coded at nucletotide position 649. 

10 Figure 4. Sequence of the DNA fragment encoding a 

GPI-anchored IBkDa protein (Seq. ID No. 2). The complete 
nucleotide sequence of the construct present on plasmid pBY48, 
derived from GASl and the gene encoding 18kDa, is shown. The 
GASl 5' sequence is identical to that present on plasmid 

15 pBY40, shown in Figure 3. From nucleotide 380 to nucleotide 
830, the sequence is derived from the gene encoding the IBkDa 
protein. The remaining sequence is derived from the 3* end of 
the GASl gene and is identical to that shown in Figure 3. 

Figure 5 - Expression of GPI-anchored Human Insulin 

20 in Yeast. Colonies of yeast bearing the plasmid pBY40, which 
codes for GPI-anchored insulin, were grown in a solid medium 
and transferred, in duplicate, to nitrocellulose filters. As a 
control, colonies of yeast bearing only the vector yEP352 
(here called empty RH273-1A) and colonies bearing a plasmid 

25 coding for nomally anchored protein Gasl were also included. 
The filters were treated with sodium hydroxide in order to 
permeabilize the wall of the yeast. In A), the filter was 
treated with T. brucei phospholipase C. In B), the filter was 
not treated. This was followed by incubation with anti-CRD 

30 antibody and visualization of the reaction with protein A 
marked with iodine-125 after autoradiography. 

Figure 6 - Expression of the 18kDa Protein in An- 
chored Form. Extracts of yeast cells bearing the plasmid 
pBY48 were prepared by stirring with glass beads. The total 

35 extract was subjected to electrophoresis in polyacrylamide 
gel, in duplicate, and then to immuno-blot. The filter shown 
in A) was incubated with monoclonal antibody against the ISkDa 
protein. In B), the filter was treated with T. brucei 
phospholipase C and incubated with anti-CRD antibody. In fil- 
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ter A), line 1, purified 18kDa protein was applied; in line 2, 
cellular extract treated with endoglycosidase H; in line 3/ 
extracts of cells lacking the plasmid, treated with 
endoglycosidase H; in lines 4 and 5, extracts as in 2 and 3 
5 but without the treatment with endoglycosidase H. In filter 
B), line 1 contains T. brucei mfVSG; line 2, extract of empty 
strain; line 3, extract of strain bearing plasmid with the 
gene GASl; line 4, extract of empty strain; line 5/ extract of 
strain expressing anchored ISkDa protein (arrow). 

10 Figure 7 - Solubilization of Anchored Protein by 

Treatment with phospholipase C. The solubilization can be 
monitored by phase separation in Triton X-114, followed by the 
detection of the product with anti-CRD, A sample of extract 
of wild type S. cerevisiae was treated with B. thuringiensis 

15 PLC (lanes 1) or T. brucei PLC (lanes 2) at 30**C for 30 min- 
utes. At the end of the reaction, the concentration of TX-114 
was increased to 2% and phase separation was induced by rais- 
ing the temperature. Duplicates of each phase were subjected 
to SDS-PAGE, one of them being treated with anti-Gasl serum 

20 (Panel A) and the other with anti-CRD (Panel B). The detection 
of the bound antibody was made with protein A marked with 
iodine-125, followed by autoradiography. The detergent and 
aqueous phases are indicated by D and A, respectively. 

The following examples are intended merely to illus- 

25 trate specific modes of carrying out the present invention and 
should not be construed as limiting its scope. 

Preliminary Comments on the Materials, Strains and 
Plasmids Reagents for DNA modification and restriction were 
acquired from Gibco-BRL or BioLabs. Other reagents, salts and 

30 materials were acquired from Sigma, Gibco-BRL, Amersham or 
other similar ones. Media for cultivation of microorganisms 
were acquired from Difco Laboratories. All the other chemical 
compounds used are of analytical grade. £• coli strains used 
for the propagation and amplification of the plasmids de- 

35 scribed herein were DH5 (Hanahan, 1983, J. Mol. Biol. 
166:557). The S. cerevisiae strain can be RH273-1A, as de- 
scribed in Nuoffer et al; 1991, Mol. Cell. Biol. 11:27. Tech- 
niques for handling S. cerevisiae, as well as culture media 
for growing it, and preparation of cellular extracts are de- 
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scribed in Guide to Yeast Genetics and Molecular Biology, Eds. 
Guthrie & Fink, Meth. Enzymol. 194. The plasmids described 
below were obtained in E. coli and, after checking their 
structures by known techniques, they were transferred to S. 
5 cerevisiae. 

Example 1 - Construction of the Vector for Ex- 
pression of Anchored Proteins. Any plasmid capable of being 
maintained stable in S. cerevisiae cells and having a marker 
for selection in these cells can be used as a basic vector, as 

10 for instance, those containing the gene LEU2 or URA3 for 
complementation of auxotrophic markers, and containing se- 
quences capable of mediating their replication, as derived 
from "ars" (autonomous replicating sequences) or from the 
endogenous plasmid 2 micron. They can still contain 

15 centromeric sequences that allow the plasmid to be maintained 
in a low number of copies, in a relatively stable manner. Here 
one prefers to use the plasmid YEP352 (Hill et al., 1986, 
Yeast 2:163), which contains the gene DRA3, and a replication 
sequence derived from 2 micron so as to mantain it in a high 

20 number of copies and, thus, to increase the amount of the 
recombinant product. In this plasmid a fragment of DNA was 
inserted which was obtained by a PCR reaction from the cloned 
GASl gene (Nuoffer et al., 1991, Mol. Cell. Biol. 11:27). 
This fragment will contain the promoter as well as the amino- 

25 terminal sequence of GASl, which will allow the transfer of 
the recombinant polypeptide to the endoplasmic reticulum. 
This fragment can be obtained through PCR by using, as prim- 
ers, oligonucleotides complementary, for example, to the re- 
gion located around the position -250 with respect to the 

30 first codon, and to the region located around the position 
+100 with respect to the GASl first codon. In this way, one 
ensures the presence, in this segment, of the GASl promoter 
and of the sequence which signals the transport to the 
endoplasmic reticulum. In this example two primers with the 

35 sequences 5 • -TTTCCCGGGT-ATTCCTCATACAGC-3 • and 

5'-ACGGGATCCGTTGGAGTAGAAAAACT-3 * were used to obtain a frag- 
ment containing the promoter and amino-terminal sequence of 
the GASl gene, by PCR-amplif ication of the chromosomal DNA. 
The 365 nucleotide fragment obtained contain sequences from 
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248 nucleotides upstream of the start site of translation of 
the GASl gene up to nucleotide +117 from the starting ATG. The 
PGR product was blunt-ended cloned into M13, and one clone was 
chosen for sequencing to determine the correctness of the se- 
5 guence and for further use. Preferably, this fragment can be 
inserted in the vector yEp352 so as to contain, downstream of 
the GASl sequence, a polylinker in order to allow the easy 
cloning of genes coding for proteins the expression of which 
is desired. It is convenient to sequence the obtained clone 

10 obtained in order to be sure that the sequence is correct. In 
this plasmid one can still insert a DNA fragment derived from 
the GASl gene by amplification through PGR, coding for the 
Gasl protein C terminal sequence and further containing the 
signal of termination of transcription of the same gene. This 

15 fragment can be obtained, for instance, by using a pair of 
primers complementary to the region close to nucleotide -180 
with respect to the GASl-terminating codon, and to the region 
close to nucleotide +300 with respect to the GASl-terminating 
codon. In this way, the presence, in this fragment, of the 

20 sequence which determines the anchor addition, as well as of 
sequences of transcription termination and of poly-A tail ad- 
dition in the message, is ensured. In this example, the 
carboxyl sequence of the GASl gene, containing the anchor ad- 
dition site, was obtained by PGR amplification with the 

25 oligonucleotides 5 • -ACGGTCGACTCTTCTTCCAAGTCTAA-3 * and 

5'-CCCCAAGCTTGCTGATATTATGGAGAA-3' . The product was blunt-ended 
cloned initially into BlueScript, for sequencing. The ampli- 
fied 495 nucleotide fragment encodes the last 61 amino acids 
of GASl, plus 313 nucleotides of the 3' non-coding region of 

30 GASl* The insertion of a fragment coding for any protein "X" 
whatever between these two Gasl segments will allow the syn- 
thesis of a fusion protein, which after post-translational 
processings will be anchored to the membrane through a GPI an- 
chor (Figure 1) . 

35 Example 2 - Construction of the Vector for £x- 

pression of Anchored Insulin. In the plasmid constructed as 
described above or alternatively as described in Figure 2A, 
one can insert a chemically or enzymatically synthesized DNA 
fragment coding for human insulin. For instance, one can ob- 
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tain a DNA fragment coding for the B and A chains, in this or- 
der, and insert, between the amino acids which separate them 
in the active form, two basic residues which will allow its 
processing inside the endoplasmic reticulum, or later, after 
5 purification of the product. The insulin-encoding sequence, 
which can be easily derived according to the codon usage fre- 
quency for yeast, starting from the known amino acid sequence, 
can bie obtained in several ways, as for instance, through the 
synthesis of two oligonucleotides so that the nucleotides in 

10 the 3' region of one will be complementary to the 3* sequence 
of the other, in an approximate extent, for instance, of about 
20 residues. After annealing, the treatment is effected with a 
DNA polymerase in order to obtain a double-stranded DNA, which 
can then be ligated to the previously described vector, in a 

15 site suitable for obtaining a fusion with the amino- and 
carboxy-terminal moieties of Gasl. 

In this particular example, the insulin-coding se- 
quence was obtained by the use of two oligonucleotides with 
the following sequences: 

20 5 • -ACGGGATCCAAGAGATTTGTTAACCAACACTTGTGTGGTTCTCACTTGGTTGAAGCCC 
TGTACTTGGTTTGTGGTGAAAGAGGTTTCTTCTACACTCCGAAG-3 ' and 
5 • -CCGGTCGACTCTGTTACAGTAGTTTTCCAGCTGGTACAAAGAACAAATACTAGTACA 
ACATTGTTCAACAATACCTCTCTTAGTCTTCGGAGTGTAGAAGAAAC- 3 * . After 
annealing, and polymerization with Klenow, the double-stranded 

25 fragment was blunt-end cloned into M13, for sequencing. One 
particular clone |M13{T2.1)m| was kept for further use. The 
insulin sequence was then fused to the GASl-derived sequences, 
as shown in Figure 2A. Briefly, the sequence derived from the 
GASl gene encompassing the anchor addition signal, obtained by 
PGR as described previously, was isolated as a Sall-Hindlll 
fragment and inserted in plasmid YEp352, digested with the 

30 identical enzymes. This originated plasmid pBYlQ. A BamHI-Sall 
fragment coding for insulin present in M13{T2.1), obtained as 
described before, was inserted in pBY19 plasmid, after di- 
gestion of the latter with the same enzymes, giving rise to 
plasmid pBY36. A BamHI fragment, derived from plasmid pN#3, 

35 containing the promoter and signal sequence of GASl, which was 
obtained by PGR amplification, was then inserted in the BamHI 
site of plasmid pBY36. The resulting plasmid, pBY40, contains 
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the necessary information for the synthesis of a GPI-anchored 
insulin, in yeast. The DNA sequence of this fusioii and the 
amino acid sequence of the resulting protein are given in Fig- 
ure 3 (Seq. ID N** 1). 
5 Example 3 - Construction of the Vector for Ex- 

pression of Anchored 18kD Protein. A DNA fragment coding for 
the M. leprae 18 kDa antigen can be obtained by amplification 
through PGR having as template the cloned gene (Booth et al., 
1988, J. Immunol. 140 : 597). The optically synthesized frag- 

10 ment will contain restriction sites for ligation to the vector 
constructed as described in Example 1 (Figure 2B), so as to 
allow the correct phase fusion of this sequence to the se- 
quences derived from GASl. 

In this particular example, the sequence encoding 

15 the 18kDa protein of M. leprae was obtained by PGR amplifi- 
cation using as primers the oligonucleotides 
5 • -TGGTCTAGAGGTACTGACCCGTTCCG-3 ' and 
5 ' -TGCTCTAGAGGCATCTATGATTTGGT-3 ' , and as template the cloned 
gene for the 18kDa protein^ present in plasmid pUL118. This 

20 amplified fragment, containing the entire coding sequence for 
this protein, was cloned in M13 for sequencing and further 
use. A BamHl fragment from plasmid pN#3, encompassing the 
promoter and signal sequence of GASl, was inserted in the 
BamHI site of plasmid YEp352, originating plasmid pBY32. A 

25 Xbal fragment containing the coding sequences for the 18kDa 
protein, present on M13(18K)#5, obtained as described before, 
was transferred to the Xbal site of plasmid pBY32, giving rise 
to plasmid pBY43. A Sall-Hindlll fragment containing the GASl 
C-terminal fragment, present on plasmid pC(Bl), was ligated to 

30 the Sall-Hindlll sites of plasmid pBY43, forming plasmid 
pBY48, which is capable of encoding a GPI-anchored 18kD pro- 
tein in yeast. The DNA sequence of this construct and the cor- 
responding amino acid sequence of the fusion are given in 
Figure 4 (Seq. ID 2) . 

35 Example 4 - Obtention of Yeast Transformed with the 

above Plasmids. The plasmids obtained in Examples 2 and 3 can 
be introduced in strains of yeast by conventional methods, 
such as transformation by lithium acetate or transformation of 
spheroplasts (Guide to Yeast Genetics and Molecular Biology, 
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Guthrie & Fink, Eds., 1991, Academic Press, Inc.)* strain 
to be used can be preferably RH273-1A, since it lacks the Gasl 
protein. Transf ormants should be selected in a minimum medium 
through selection by complementation of auxotrophic markers, 
5 in this case growth in the absence of uracil. 

In this example, the strain constructed to express 
GPI-anchored insulin is called RH273-lA/pBY40 , and the strain 
constructed to express GPI-anchored ISkDa protein is called 
RH273"lA/pBY48. 

10 Example 5 - Detection of the Expression of 

GPI-Anchored Insulin. Strain RH273-lA/pBY40 , constructed as 
described in Example 4, can be grown in isolated colonies, in- 
itially in a selective medium, and thereafter replicated to a 
rich medium. The cells can be transferred to nitrocellulose 

15 membranes, lysed, treated with phospholipase C obtained from 
Trypanosoma brucei, and with anti-CRD antiibody. The presence 
of CRD epitope can be visualized through the incubation of 
this filter with protein A coupled to iodine-125, as described 
in Cardoso de Almeida & Turner, 1983, op.cit. (figure 5). 

20 Example 6 - Detection of the Expression of 

GPI-Anchored IBkDa protein. Similarly as described in Example 
5, the expression of anchored 18 kOa protein in RH273-lA/pBY48 
cells can be detected by using the anti-CRD antibody. Alterna- 
tively, the anchored 18 kDa protein can be easily visualized 

25 in immuno-blots from SDA-PAGE of cellular extracts (figure 6). 
Both monoclonal L5 and anti-CRD antibodies recognize the 
recombinant product, which is highly glycosylated, as shown by 
the high molecular weight of the product. The glycosylation 
occurs in residues derived from Gasl, and can be eliminated by 

30 the construction of a fusion wherein the residues susceptible 
of glycosylation are removed. 

Example 7 - Release with GPI-specific phospholipase. 
The GPI-anchored proteins can be solubilized by treating the 
cells or cellular extracts with phospholipase specific for GPI 

35 structures, as shown in figure 7. 

Example 8. Utilization of GPI as a vaccine or immune 
system modulator. The GPI moieties produced by engineered 
organism can be purified and analysed according to standard 
procedures such as solvent selective extraction and 
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fractionation by thin layer chromatography or reverse phase 
thin layer chromatography or by cromatography on Octyl- 
Sepharose. These glycolipids, or part of them^ chemically or 
enzymatically treated, are then prepared with appropriate 
5 adjuvants and administered as vaccines or immune system 
modulators. Their effects can be followed by classical methods 
of humoral and cellular immunology such as titrating levels of 

4 

antibodies with certain specificities or by analysing the pro- 
file of lymphocytes which can be stimulated by the injected 
10 antigen. 

The sequences shown in figures 3 and 4, Seg. ID No. 
1 and Seq. ID No. 2, without however the amino acid indi- 
cation, are also submitted separately both in printed and in 
computer readable form. 
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CLAIMS 

1. A process for producing a recombinant protein, or 
a precursor thereof, in cells of genetically modified 
eukaryotic microorganisms, characterized by comprising the 
5 steps of: 

a) biosynthesizing said protein or a precursor 
thereof by the cell of said microorganism and linking the 
endogenous glycosylphosphatidylinositol (GPI) to the C- 
terminal amino acid of the obtained protein or precursor, 

10 with the consequent anchorage of said protein or precursor to 
membranes of said microorganism by means of GPI; and 

b) selectively releasing said protein or precursor, 
obtained in step (a), by methods making use of intrinsic prop- 
erties provided by the presence of GPI. 

15 2. A process according to claim 1, characterized in 

that said eukaryotic microorganism is a genetically modified 
Saccharomyces cerevisiae. 

3. A process according to claim 2, characterized in 
that said eukaryotic microorganism is a Saccharomyces 

20 cerevisiae, genetically modified so as not to produce 
GPI-anchored endogenous proteins. 

4. A process according to claim 2, characterized in 
that said eukaryotic microorganism is Saccharomyces 
cerevisiae, genetically modified so as not to produce the 

25 endogenous protein Gasl. 

5. A process according to any one of claims 1 to 4, 
characterized in that said recombinant protein is human 
insulin. 

6. Process according to any one of claims 1 to 4, 
30 characterized in that said recombinant protein is 

Mycobacterium leprae antigen 18kDa. 

7. A process according to any one of claims 1 to 6, 
characterized in that the selective release of the protein or 
its precursor obtained in step a) is carried out by enzymatic 

35 treatment. 

8. A process according to claim 7, characterized in 
that the treatment is carried out with PI- or GPI-specific 
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phospholipase . 

9* A process according to claim 8, characterized in 
that the treatment is carried out with PI- or GPI-specific 
phospholipase of C or D specificity. 
5 10. A process according to claim 7, characterized in 

that the treatment is carried out with proteases. 

11. A process according to any one of claim 1 to 6, 
characterized in that the selective release of the protein or 
its precursor obtained in step a) is carried out by chemical 

10 treatment. 

12. A process according to claim 11, characterized 
in that the treatment is carried out by nitrous dearaination. 

13. A process according to claim 11, characterized 
in that the treatment is carried out with a base. 

15 14. A process according to any one of claims 1 to 6, 

characterized in that the selective release of the protein or 
its precursor obtained in step a) is carried out by treatment 
with neutral detergents. 

15. A process according to claim 14, characterized 
20 in that the treatment is carried out with neutral detergents 

of the type n-octyl glucopyranoside. 

16. A process according to any of claims 1 to 6, 
characterized in that the selective release of the protein or 
its precursor obtained in step a) is carried out by co- 

25 expression of a PI- or GPI-specific phospholipase in the cell 
of said microorganism. 

17. A process according to claim 9, characterized in 
that one works with a PI- or GPI-specific phospholipase C, 
thereby obtaining hydrolysed protein or its precursor, which 

30 has in its structure cyclic 1, 2-phosphate inositol (epitope 
CRD), which is susceptible of being selectively purified by 
immunoaf inity with anti-CRD antibodies. 

18. A process for the production of 
glycosylphosphatidylinositol (GPI) and of a recombinant pro- 

35 tein in cells of genetically modified eukaryotic 
microorganisms, characterized by comprising the steps of: 

a) biosynthesising said protein or a precursor 
thereof by the cell of said microorganism and linking the 
endogenous glycosylphosphatidylinositol (GPI) to the C- 
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terminal amino acid of the obtained protein or precursor, with 
the consequent anchorage of said protein or precursor to mem- 
branes of said microorganism by means of GPI; 

b) selectively releasing said protein or precursor , 
5 obtained in step (a), by methods making use of intrinsic prop- 
erties provided by the presence of GPI, and 

c) selectively releasing GPI. 

19. A process according to claim 18, character issed 
in that said eukaryotic microorganism is a genetically modi- 

10 fied Saccharomyces cerevisiae. 

20* A process according to claim 19, characterized 
in that said eukaryotic microorganism is a Saccharomyces 
cerevisiae, genetically modified so as not to produce 
endogenous GPI-anchored proteins. 

15 21. A process according to claim 19, characterized 

in that said eukaryotic microorganism is a Sacchromyces 
rerevisiae, genetically modified so as not to produce the 
endogenous protein Gasl. 

22. A process according to any one of claim 18 to 
20 21, characterized in that said recombinant protein is human 

insulin. 

23. A process according to any one of claims 18 to 
21, characterized in that said recombinant protein is 
Mycobacterium leprae antigen 18kDa. 

25 24. A process according to any one of claims 18 to 

23, characterized in that the selective release of the protein 
or its precursor or GPI or of its precursor obtained in step 
a) is carried out by enzymatic treatment. 

25. A process according to claim 24, characterized 
30 in that the treatment is carried out with PI- or GPI-specific 

phospholipase . 

26. A process according to claim 25, characterized 
in,. that the treatment is carried out with PI- or GPI-specific 
phospholipase of C or D specificity. 

35 27. A process according to claim 24, characterized 

in that the treatment is carried out with proteases. 

28. A process according to any one of claims 18 to 
23, characterized in that the selective release of the protein 
or its precursor or of GPI or its precursor obtained in step 
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a) is carried out by chemical treatment. 

29. A process according to claim 28, characterized 
in that the treatment comprises extractions with organic sol- 
vents. 

5 30. A process according to claim 28, characterized 

in that the treatment is carried out with a base. 

31. A process according to any one of claims 18 to 
23, characterized in that the selective release of the protein 
or its precursor obtained in step a) is carried out by treat- 

10 ment with neutral detergents. 

32. A process according to claim 31, characterized 
in that the treatment is carried out with neutral detergents 
of the type n-octyl glucopyranoside. 

33. A process according to any one of claims 18 to 
15 23, characterized in that the selective release of the protein 

or its precursor obtained in step a) is carried out by co- 
expression of a PI- or GPI-specific phospholipase in the cell 
of said microorganism. 

34. A process according to claim 26, characterized 
20 in that one works with PI- or GPI-specific phospholipase C, 

thereby obtaining hydrolysed protein or its precursor, which 
has in its structure cyclic 1, 2-phosphate inositol (epitope 
CRD), which is susceptible of being selectively purified by 
immunoaf inity with anti-CRD antibodies. 
25 35. A process for the production of 

glycosylphosphatidylinositol (GPI) in cells of a genetically 
modified Saccharomyces cerevisiae characterized by comprising 
the steps of: 

a) biosynthesis of a protein or a precursor thereof 
30 and biosynthesis of GPI or a precursor thereof by said cells ; 

and 

b) selective release of GPI. 

36. A process according to claim 36, characterized 
in that said Saccharomyces cerevisiae is one genetically modi- 

35 fied so as not to produce the endogenous protein Gasl. 

37. A process for the obtention of cells of 
Saccharomyces cerevisiae yeasts capable of expressing a 
recombinant protein, characterized by comprising the steps of: 

a) providing the gene coding for said recombinant 
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protein or its precursor so as to be anchored via GPI; 

b) inserting the gene, obtained in step a), in an 
suitable vector? 

c) introducing the vector containing the gene coding 
5 for said recombinant protein or its precursor in a 

Sacchromyces cerevisiae cell; and 

d) propagating the cell obtained in step c) * 

38. A process according to claim 31, characterized 
in that said recombinant protein is human insulin. 
10 39. A process according to claim 37 f characterized 

in that said recombinant protein is Mycobacterium leprae ISkDa 
antigen. 

40. A process according to claim 37 , characterized 
in that the Saccharomyces cerevisiae cell used lacks the capa- 

15 bility of producing the endogenous protein Gasl. 

41. A yeast cell, characterized by being genetically 
modified so as to express a GPI-anchored recombinant protein. 

42. A cell according to claim 41, characterized by 
being Saccharomyces cerevisiae. 

20 43. A cell according to claim 42, characterized by 

not expressing the endogenous protein Gasl. 

44. A cell according to any one of claims 40 to 43, 
characterized in that said recombinant protein is human 
insulin. 

25 45. A cell according to any one of claims 40 to 43, 

characterized in that said recombinant protein is 
Micobacterium leprae 18kDa antigen. 

46. A nucleotide sequence, characterized by coding 
for a recombinant protein or its precursor, susceptible of be- 

30 ing anchored by GPI. 

47. A nucleotide sequence, characterized by coding 

for : 

i) the C-terminal moiety of the protein Gasl of 
Saccharomyces cerevisiae; 
35 (ii) a protein or a precursor thereof, and 

(iii) the N-terminal moiety of the protein Gasl of 
Saccharomyces cerevisiae • 

48. A nucleotide sequence according to claim 47, 
characterized in that said protein is human insulin. 
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49. A nucleotide sequence according to claim 47, 
characterized in that said protein is Mycobacterium leprae 
18kDa antigen. 

50. A culture medium, characterized by containing 
5 cells according to any one of claim s 41 to 43. 

51. A medicine or vaccine, characterized by contain- 
ing a recombinant protein obtained by a process according to 
any one of claims 1 to 34 or a recombinant protein of which 
the precursor was obtained by a process according to any one 

10 of claims 1 to 34. 

52. A medicine according to claim 51, characterized 
by containing recombinant human insulin. 

53. A vaccine according to claim 51, characterized 
by containing Mycobacterium leprae 18kDa antigen. 

15 54. A medicine or vaccine characterized by contain- 

ing a GPI obtained by a process according to any one of claims 
35 to 40 or a GPI of which the precursor was obtained by a 
process according to any one of claims 35 to 40. 

55. A product obtainable by a process according to 
20 any one of claims 1 to 40. 

56. A process according to claim 35, characterized 
in that step a) further comprises linking the endogenous 
glycosylphosphatydylinositol (GPI) to the C-terminal amino 
acid of the obtained protein or its precursor, with the conse- 

25 guent anchoring of said protein or precursor to membranes of 
said microorganism by means of GPI. 

57. A cell according to any one of claims 41-45, 
characterized by the capacity of increasing the levels of GPI 
biosynthesis or of producing GPI with determined properties. 

30 58. Method for detecting the presence of a GPI an- 

chor in insulin or in 18kDa protein, by means of immuno-blots 
with antibody anti-CRD after treatment of the filters with 
PLC. 
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FIGURE 1 . Scheme of the precursor and of the GP I - anchored final 
product in S. cerevisiae , and of its release with phospholipase 
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FIGURE 2A. 




wo 95/22614 



3/10 



PCT/BR9S/00010 



FIGURE 2B. 
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FIGURE 3 



ANCHORED INSULIN (pBY40) SEQUENCE 



GGATCCCCTTTCCCGGGTATTCCTCATACAGCCTGCGCGGTTTATTAGTAAAATACCCGA 
10 20 30 40 50 60 



TAATCCTCGAGGTTTGAAAAACTTTTCCCTCTACTACTGTTGACACGGAT'iTiTrTATTT 
70 80 90 100 110 120 



AAGAGGAAAAGTCGTGGTTGTTTTCCTCGAACAAATTAGATATCCATAAATAGTTGTGTC 
130 140 150 160 170 180 



GTTTTATTAAGCTATTTCAAAATCAGTTTTTATTTTTTAAAGTCTGATAAAACAAAAACA 
190 200 210 220 230 240 



MLFKSLSKLATA 
ACAAACACAGCTAAATCTCAACAATGTTGTTTAAATCCCTTTCAAAGTTAGCAACCGCTG 
250 260 270 280 290 300 



AAFFAGVATADDVPAIEVVG 
CTGCTTTTTTTGCTGGCGTCGCAACTGCGGACGATGTTCCAGCGATTGAAGTTGTTGGTA 
310 320 330 340 350 360 



NKFFYSNGSKRFVNQHLCGS 
ATAAGTTTTTCTACTCCAACGGATCCAAGAGATTTGTTAACCAACACTTGTC 

370 380 390 400 410 420 



HLVEALYLVCGERGFFYTPK 
ACTTGGTTGAAGCCCTGTACTTGGTTTGTGGTGAAAGAGGTTTCTTCTACACTCCGAAGA 
430 440 450 460 470 480 



TKRGIVEQCCTSICSLYQLE 
CTAAGAGAGGTATTGTTGAACAATGTTGTACTAGTATTTGTTCTTTGTACCAGCTGC^^ 
490 500 510 520 530 540 



NYCNRVDSSSKSNSGSSGSS 
ACTACTGTAACAGAGTCGACTCTTCTTCCAAGTCTAACTCCGGCTCTTCTGGTTCTTCCA 
550 560 570 580 590 600 



SSSSSSSASSSSSSKKNAAT 
GTTCTTCTTCTTCTTCTTCAGCTTCATCTTCATCTTCTAGCAAGAAGAATGC^^ 

610 620 630 640 650 660 
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FIGURE 3 (cont) 



^..^ KANLAQVVFTSIISLSIA 

ACGTTAAAGCTAACTTAGCACAAGTGGTCTTTACCTCCATCATTTCCTTATCCATTCK^ 

^■70 680 690 700 710 720 

A G V G F A L V * 

CTGGTGTCGGTTTTGCTTTGGTTTAAAAAGCTagctTCGACACATACAT^^ 

740 750 760 770 780 

AAGGTATTCTATCTTATTTCATTGTGGGGTAGTTTTTACGAAAAAAATGA^^ 

800 810 820 830 840 

GTATAGTATATATTTTTTTTCTATGTAAGTTTTATAAGATTCT^ 

860 870 880 890 900 

GTAAATTAAAAAGAACACTATTGTTACATTATATGTTTTTAAATCATCAAAAAAGAC^ 

920 930 940 950 960 

ATTCATTTAATAOTCCTTATAGAACTACTTAACATTGTTCTTCTTTC 

970 980 990 1000 1010 1020 

TATGCAAACCATTTATGTACTTTCTCCATAATATCAGCAAGCTT 
1030 1040 1050 1060 
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FIGURE 4 



ANCHORED 18kD (pBY48/49) SEQUENCE 



GGATCCCCTTTCCCGGGTATTCCTCATACAGCCTGCGCGGTTTATTAGTAAAATACCCGA 
10 20 30 40 50 60 



TAATCCTCGAGGTTTGAAAAACTTTTCCCTCTACTACTGTTGACACGGATTTTTTO 

70 80 90 100 110 120 



AAGAGGAAAAGTCGTGGTTGTTTTCCTCGAACAAATTAGATATCCATAAATAGTTGTGTC 
130 140 150 160 170 180 



GTTTTATTAAGCTATTTCAAAATCAGTTTTTATTTTTTAAAGTCTGATAAAACAAAi^ 

190 200 210 220 230 240 



MLFKSLSKLATA 
ACAAACACAGCTAAATCTCAACAATGTTGTTTAAATCCCTTTCAAAGTTAGCAACCGCTG 
250 260 270 280 290 300 



A A F F AGVATADDVPAIEVVG 
CTGCTTTTTTTGCTGGCGTCGCAACTGCGGACGATGTTCCAGCGATTGAAGOTGTTGCT 
310 320 330 340 350 360 



N K F F YSNGSSRRTDPFRELD 
ATAAGTTTTTCTACTCCAACGGATCCTCTAGACGTACTGACCCGTTCCGTGAACTGGACC 
370 380 390 400 410 420 



RFAEQVLGTSARPAVMPMDA 
GCTTCGCCGAGCAAGTGTTAGGTACGTCTGCCCGCCCAGCAGTAATGCCCATGGACGCTT 
430 440 450 460 470 480 



WREGEE FVVEFDLPGI KADS 
GGCGTGAGGGCGAAGAATTCGTCGTCGAGTTCGACCTTCCTGGCATCAAAGCCGAT^ 

490 500 510 520 530 540 



LDIDIERNVVTVRAERPGVD 
TGGACATTGACATCGAACGCAACGTAGTCACCGTGCGGGCCGAGCGCCCAGGCGTCGACC 
550 560 570 580 590 600 



PDREMLAAERPRGVFNRQLV 
CCGATCGGGAAATGCTTGCTGCCGAGCGGCCACGCGGTGTGTTCAATCGTCAGCTGGTTC 
610 620 630 640 650 660 
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FIGURE 4(cont) 

LGENLDTERILASYQEGVLK 
TCGGCGAAAACCTCGACACCGAACGGATCTTGGCCTCCTACCAAGAAGGTGTCCTGAAGT 
670 680 690 700 710 720 

LSIPVAERAKPRKISVDRGN 
TGTCGATACCAGTAGCCGAAAGGGCTAAACCGCGCAAGATCTCCGTTGATCGTGGCAACA 
730 740 750 760 770 780 

NGHQTINKTAHEI IDASRVD 
ACGGACACCAGACCATAAACAAAACCGCACACGAAATCATAGATGCCTCTAGAGTCGACT 
790 800 810 820 830 840 

SSSK5HSGSSGSSSSSSSSS 
CTTCrrCCAAGTCTAACTCCGGCTCTTCTGGTTCTTCCAGTTCTTCTTCTTCTTCTT^ 
850 860 870 880 890 900 

ASSSSSSKKNAATNVKAMLA 
CTTCATCTTCATCTTCTAGCAAGAAGAATGCTGCCACCAACGTTAAAGCTAACTTAGCAC 
910 920 930 940 950 960 

QVVFTS I ISLSIAAGVGFAL 
AAGTGGTCTTTACCTCCATCATTTCCTTATCCATTGCCGCTGGTGTCGGTTTTGCTTTGG 
970 980 990 1000 1010 1020 

V ♦ 

TTTAAAAAGCTagctTCGACACATACATAATAACTCGATAAGGTATGGTATCTTATTTCA 
1030 1040 1050 1060 1070 1080 

TTGTGQGGTAGr iTl TACGAAAAAAATGAAAAGTTGTAAGTATAGTATATAl"i'lUTl"^^^ 
1090 1100 1110 1120 1130 1140 

TATCTAAGTTTTATAAGATTCTATOCGCTATTACCACCGGTAAATTAAA^ 

1150 1160 1170 1180 1190 1200 

TGTTACATTATATGTTTTTAAATCATCAAAAAAGACAATATTCATTTAATATTCCTTA^^ 
1210 1220 1230 1240 1250 1260 

GAACTACTTAACATTGTTCTTCTTTCTATTAAACGTCTTTATGCAAACCATTTATGTACT 
1270 1280 1290 1300 1310 1320 

TTCTCCATAATATCAGCAAGCTT 
1330 1340 
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FIGURE 5. Expression of GPI - anchored human insulin in yeast 
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FIGURE 7. Solubilization of ancliored protein by treatment with 
phosphoiipase C. 
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